Semantic disambiguation of taxonomies

نویسندگان

  • David Sánchez
  • Antonio Moreno
چکیده

Polysemy is one of the most difficult problems when dealing with natural language resources. Consequently, automated ontology learning from textual sources (such as web resources) is hampered by the inherent ambiguity of human language. In order to tackle this problem, this paper presents an automatic and unsupervised method for disambiguating taxonomies (the key component of a final ontology). It takes into consideration the amount of resources available in the Web as the base for inferring information distribution and semantics. It uses co-occurrence analysis and clustering techniques in order to group those taxonomical concepts that belong to the same “sense”. The final results are automatically evaluated against WordNet synsets.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Knowledge-based sense disambiguation (almost) for all structures

Structural disambiguation is acknowledged as a very real and frequent problem for many semantic-aware applications. In this paper, we propose a unified answer to sense disambiguation on a large variety of structures both at data and metadata level such as relational schemas, XML data and schemas, taxonomies, and ontologies. Our knowledge-based approach achieves general applicability by converti...

متن کامل

1 Rerendering Top Ontologies

Data mining and information extraction rely on a number of natural language tasks that require semantic typing; that is, the ability of an application to accurately determine the conceptual categories of syntactic constituents. Semantic typing serves tasks such as relation extraction by improving anaphora resolution and entity identification. Domain-specific semantic typing also benefits statis...

متن کامل

Taxonomies with Lattice Algebras

In this paper taxonomies are considered to be mathematical lattices. This gives us a \two-edged sword" for describing taxonomies. On the one-hand, we have a concise algebra suitable for speciication of taxonomies, and on the other an order theoretic representation suitable for visual presentation of ontologies. A language based on algebraic lattices is described and given a model-theoretic sema...

متن کامل

Making Explicit the Hidden Semantics of Hierarchical Classifications

Hierarchical classifications are concept hierarchies used to organize large amounts of documents. File systems, products’ taxonomies for the market place and the directories provided by Web portals are common examples of hierarchical classifications. As semi-structured knowledge sources, hierarchical classifications have peculiar features: they differ both from plain texts since they are based ...

متن کامل

A semantic approach for extracting domain taxonomies from text

In this paper we present a framework for the automatic building of a domain taxonomy from text corpora, called Automatic Taxonomy Construction from Text (ATCT). This framework comprises four steps. First, terms are extracted from a corpus of documents. From these extracted terms the ones that are most relevant for a specific domain are selected using a filtering approach in the second step. Thi...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007